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^ ■ Abstract 
O 

:^ 

pH . We discuss different statistical distances in probability space, with em- 
^ . 

^ . phasis on the Jensen-Shannon divergence, vis-a-vis metrics in Hilbert space 



and their relationship with Fisher's information measure. This study provides 
further reconfirmation of Wootters' hypothesis concerning the possibility that 
statistical fluctuations in the outcomes of measurements be regarded (at least 
partly) as responsible for the Hilbert-space structure of quantum mechanics. 
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I. INTRODUCTION 

Wootters [1] has shown that nature defines the distance Dw between two quantum 
states, say, ipi and ip2, by counting the number of distinguishable intermediate states, which 
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establishes a link between statistics (distances in probability spaces, determined by the 
size of statistical fluctuations [1]) and geometry (metrics in Hilbert space) that has been 
considerably strengthened in [2]. The claim is made, however, that a proper understanding 
of this link needs still further elaboration [1] , raising the interesting possibility that statistical 
fluctuations in the outcomes of measurements might be partly responsible for the Hilbert- 
space structure of quantum mechanics. An illuminating expansion of the work of [1] is 
provided in [2] and some footnotes can be found in [3]. Quite lucid, insightful, and also 
didactic mathematical treatments pertaining to the fleld of information geometry are those 
of Refs. [4-7] (see also references therein). 

Obviously, the concept of distance between different rays in (the same) Hilbert space 
plays an important role in a variety of circumstances, like different preparations of the same 
system [1] or the geometric properties of the quantum evolution sub-manifold [8]. It is 
also relevant, for instance, in discussing squeezed coherent states, displaced number states, 
generalized coherent spin states, etc. [8], or for ascertaining the quality of approximate 
treatments [3] . One encounters it as well in connection with the detection of weak signals. 

Inspired by the pioneer effort of Wootters [1] , a set of well-known Hilbert space metrics 
were compared, in Ref. [3], to that underlying Wootters' Dw Let a quantum state ■0a(x) 
be parameterized by n real parameters collectively denoted by the symbol a. A set of rays 
corresponding to the states with all possible a- values constitutes an n-dimensional manifold 
JC of the Hilbert space H. Set ■01 = ■0a(x) and ■02 = ■'/'a+Aa(x). Expanding now ■02 in a 
Aa-series, it was found in [3] that, up to second order in Aa, several Hilbert space distances 
between 1^2 and ipi coincide with Wootters' D^y, as does also the KuUback's cross-entropy 
[9] between [■02p and [■j/'ip. 

Here we will pursue the work of [3] i) by incorporating additional measures like the 
J ens en- Shannon divergence and, more importantly, ii) by providing still another interpreta- 
tion to the role of fluctuations on which the work of Wootters' originally focused attention. 
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II. FISHER'S INFORMATION MEASURE 



An information measure can primarily be viewed as a a quantity that characterizes a 



N 

s[P] = H Pj ) , (1) 

is regarded as the measure of the uncertainty associated to probabihstic physical processes 



abihty vector in the probability space Q C TZ^). We will be concerned here mainly with 
another important information-theoretic measure: that of Fisher's (/) [12,13], advanced by 
R. A. Fisher in the twenties. 

/ has been the subject of much work lately (a detailed study can be found in refer- 
ences [12,13]). Let us consider a system that is specified by a physical parameter a, while x 
is a stochastic variable (x e JJ^) and Pa(x) the probability density for x, which depends 
on the parameter a. An observer makes a measurement of x and has to best infer a from 
this measurement, calling the resulting estimate a = q;(x). One wonders how well a can 
be determined. Estimation theory [14] asserts that the best possible estimator Q!(x), after a 
very large number of x-samples is examined, suffers a mean-square error for a that obeys 
a relationship involving Fisher's /, namely, le^ — 1, where the Fisher information measure 
/ is of the form 



This "best" estimator is called the efficient estimator. Any other estimator must have 
a larger mean-square error. The only proviso to the above result is that all estimators be 
unbiased, i.e., satisfy ((5;(x)) = a. Thus, the inverse of Fisher's information measure provides 
a lower bound for the mean-square error associated with the statistical inference of the 
parameter a. No matter what estimator we use (as long as it is an unbiased estimator) we 
have > l/I. This inequality is referred to as the Cramer- Rao bound [13]. 




described by the probability distribution {pj, j = 1, - ■ ■ , N} (P = (pi,p2, • • • ,Pn) the prob- 




(2) 
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III. THE JENSEN-SHANNON DIVERGENCE 



We review now some other measures with which we will be concerned in the prese 

— * 

nt work and were not taken into account in [3]. Let P(^k) e C TZ , with k = 1,2, 
denote two different probability distributions for a particular set of N accessible states. The 
components of the two probability vectors must satisfy the following two constraints: a) 
J2f=i pf^ = 1 and b) < pf^ < 1 Vj. The set Q defined by these constraints is the simplex 
5'jv, which is a convex [N — l)-dimensional subset of . A quite important, information- 

— * — * 

theoretical based divergence measure between P(i) and P(2) was originally introduced by Lin 
[15] that came afterwards to be called the Jensen- Shannon divergence (JSD) [16,17] that 

• induces a true metric in Q C TZ^ , being indeed the square of a metric [18], and 

• is intimately related to the KuUback-Leibler relative entropy K for two probability 
distributions P(i) and P(2), given by [9] 

^[^(1)1^(2)] = Y.pf ^^{Pf Ipf)- (3) 



We first define 



^(l)>^(2) 



K 



and then the symmetric quantity 



Ji 



= Jo 
^2S 



Pi2),P(l) 



(4) 



(5) 



2^(1) + 2^(2)^ 



S 



PW 



s 



(2) 



Let now tti, 712 > 0; 711+712 = 1 be the "weights" of, respectively, the probability 
distributions P(i), P(2). The JSD reads 



J-TTl ,7r2 



^(1)'^(2) 



s 



7riP(i) + 7r2P(2) -711 S P{1) -T^2 S 



Pi 



(2) 



(6) 



— * — * 

which is a positive-definite quantity that vanishes iff P(i) = P(2) almost everywhere [16,17]. 
In the particular case tti = 712 = 1/2 the measure (6) is symmetric. Notice also that 



Using the q-information measures it is possible to construct a q-KuUback-Leibler relative 
entropy for two probability distributions. In particular, from the Tsallis entropy the q- 
KuUback entropy reads 

KlmPi.)] - -Epf In(p«/Pf )• (7) 
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IV. PRESENT RESULTS 



Suppose that a quantum state V'(x) is parameterized, as stated in the Introduction, by 
a real parameter that we denote by a, and write for simplicity ■0a(x) = ^^{a). We consider 
two states that differ by a change a — > a + Aa and expand the second one up to second 
order in Aa [we set A^a = (Aa)^]. 

d 



{a + dAa) >= \i>{a) > +Aa 



da 



1 d"^ 
(a) > +-A^a—\ij{a) > +... 



(8) 



We wish to compare these associated wave functions (say, ipi and '02), by recourse to different 
measures. To this end, and following Eq. (8), we expand ip{a + Aa) up to second order and 
assume that both ip{a) and ip{a + Aa) are properly normalized to unity. For the present 
purposes we set P(i) = |t/'(Q;)p and P(2) = |'0(a + Aq;)^. The symmetrized KuUback-Leibler 
relative entropy 



Ks = K[P^^)\Pi2)]+K[P^2)\Pii)] 



(9) 



is given by 



a) pin 



j dx\'il){a + Aa)\^\ii 



|^(«)P 



+ 



a + Aa)\\ 

ia + AaW 



(10) 



For simplicity's sake we restrict ourselves to one dimensional problems (the essentials of our 
discourse can already be apprehended at this stage) where, for stationary cases, one always 
can assume, without loss of generality, that wave functions are real. Up to second order in 
A^a {i/;' ^dijj/da) 



[V'(Q; + Aa)]2 = 



d In ilP" 
1 + Aa^— — + A^a 



(11) 



so that 



^(2) = ^(1) 



1 + Aa — — ^ + — — + 



da 2 P(i) 

and we recast the KuUback-Leibler measure in the fashion 



(12) 



dxP(i) In 



Ks = -J 

+ J dxP^i) 

In 



da 



2 P< 



(1), 



1 + Aa — — ^ + ^ ' 



da 



2 P 



(i)J 



9q; 



2 P 



(1) 



(13) 



We will appeal to an expansion of ln(l + y) up to first order in y. Remember that both ipla) 
and ip{a + Aa) are properly normalized, which implies 

91nP(i) 



Aa / dxPi 



(1)- 



da 



0, 



(14) 



and 



so that one obtains 



(A2q;/2) J dxPl[^ = 0, 



Ks = A'a J dxP^,) (^^)' = 



(15) 



(16) 



where I{a) is the Fisher information measure defined in (2). This relation is to be expected 
on information-theoretic grounds [13], but, within the present context, it was not discussed 
neither in [3] nor in [1], so we wish to place strong emphasis on (16). Notice also that the 
KuUback Leibler measure is stable against first order changes in Act [3]. 
Let us tackle now a subject that is new for the present scenario: the Jensen- Shannon diver- 
gence (4). Up to the second order in Aa one has: i) for Jq 



Jo 



^A2a/(a), 



(17) 



ii) for the symmetric Ji of (5) 



Ji 



dxPi 



'dlnP, 



(1) 



(1) 



da 



-A'^al{a), 



(18) 



and iii) for the quantity defined in (6) 



J 



TTl ,7r2 



P(lhP(2) 



^1^2 . 2 



A^a / dxPi 



(1) 



'ainP(i)\ _ TTiTTs 2 



5q; 



(19) 



i.e., all the Jensen- Shannon divergences (17), (18), and (19), together with the KuUback- 
Leibler measure (16), are proportional to /, our main result thusfar. Now, it was shown in 
[3] that the Euclidean distance between neighboring states can be evaluated as 

2 



dSl ^ j dx [^p{a + Aa) - ip{a)f = A^a j dx 



da 



(20) 



so that it easily follows that, up to second order in Aa, this distance is also proportional to 
/, that is well known to be a measure of the gradient of the probability- amplitude [13] 



dSl = ^A^al{a), 



(21) 



which in the present context can be regarded as a new result. Additionally, by recourse to 
the comparison between ^l){a) and ^l){a -\- Aa), we can relate the Euclidean distance with 
the Wootters one Dw- Using the following result from [3] 

dSl = 2(1- < ^(a)l^(a + Aa) >) = 2(1 - cos 7), (22) 

where 7 is a small angle, and now expanding cos 7 up to order 7^ one obtains 

dSl ^ 7^ = [arccos(< V'(a)|^/;(a + Aa) >)]^ = dS^ = Dw (23) 

where D^y = dS^ is the Wootters distance [3], so that Eq.(21) is tantamount to 

dS^ = ^A^al{a), (24) 

which can also regarded as a new result within the current context. We pass now to the 
celebrated Fubini-Study metric [19,20]. Considering neighboring states we have [3] 



dSp = 1 - I < ijj{a)\ijj{a + Act) > 



(25) 




(26) 



Again, the Euclidean distance, and also the Wootters' and Pubini-Study ones, are stable 
against first order changes in Aa and all of them coincide, up order A^a, with the Ji 
Jensen- Shannon divergence, still another new result. All these distances are proportional to 
the concomitant Fisher measure. 



Let us examine a bit more closely the problem of estimating a single parameter (a) of 
a system or phenomenon from knowledge of some measurements of the variable x [13]. 
Consider that we have at our disposal N data values of this variable Xi,...,X]\f = x. 
The system or phenomenon is governed by the conditional probability law (likelihood law) 
f{x\a) = /a(x). The data obey 



with y assumed to be intrinsic to the parameter a under measurement. As an example, a 
could be a particle's position and y its concomitant fluctuations. The system consisting of 
quantities a, y, x is a closed one [13]. The data are used in an estimation principle to form 
an estimate a oi a which is a function of all the data (say, Xlf Xi/N), and one assumes 
that the overall measurement procedure is "smart" in the sense that a is on average a 
better estimate of a than any of the data observables [13]. We see then that, on account of 
the Cramer-Rao bound / carries information with regards to intrinsic uncertainties, which, 
quantum mechanically, correspond to intrinsic fluctuations [13]. It is in this light that we 
have to regard the results of the preceding Section. 

The metric structure of the manifold JC is completely expressed by the uncertainties and 
correlations of Hermitian operators generating various evolutions of a given quantum state 



V. DISCUSSION 



X = q; -I- y; (y added noise values, ), 



(27) 
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[8] which are neatly captured by the Fubini metric, as has been demonstrated by using 
squeezed states [21]. But this metric does coincide, up to second order, with all the others 
considered here, and with Dw in particular, that gave rise to the Wootters' suggestion 
mentioned in the Introduction: statistical fluctuations in the outcomes of measurements 
might be partly responsible for the Hilbert-space structure of quantum mechanics. This view 
is now considerably strengthened in discovering that all distances (here considered) between 
quantum neighboring states, whether of statistical or Hilbert's metric origin, are proportional 
to Fisher's measure, up to second order approximation. Now, 

• since / captures, as pointed out above, the essentially fluctuating nature of the variables 
X on which the state '?/'a(x) depends, and 

• distances between neighboring states are proportional to /, it follows that 

• Wootters' viewpoint receives yet further (independent) reconfirmation. 
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